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Abstract — This paper considers the noncooperative maximiza- 
tion of mutual information in the Gaussian interference channel 
in a fully distributed fashion via game theory. This problem has 
been studied in a number of papers during the past decade for 
the case of frequency-selective channels. A variety of conditions 
guaranteeing the uniqueness of the Nash Equilibrium (NE) and 
convergence of many different distributed algorithms have been 
derived. In this paper we provide a unified view of the state-of- 
the-art results, showing that most of the techniques proposed in 
the literature to study the game, even though apparently different, 
can be unified using our recent interpretation of the waterfllling 
operator as a projection onto a proper polyhedral set. Based on 
this interpretation, we then provide a mathematical framework, 
useful to derive a unified set of sufficient conditions guaranteeing 
the uniqueness of the NE and the global convergence of water- 
filling based asynchronous distributed algorithms. 

The proposed mathematical framework is also instrumental to 
study the extension of the game to the more general MIMO case, 
for which only few results are available in the current literature. 
The resulting algorithm is, similarly to the frequency-selective 
case, an iterative asynchronous MIMO waterfllling algorithm. 
The proof of convergence hinges again on the interpretation 
of the MIMO waterfllling as a matrix projection, which is the 
natural generalization of our results obtained for the waterfllling 
mapping in the frequency-selective case. 

Index Terms — Game Theory, MIMO Gaussian interference 
channel, Nash equilibrium, totally asynchronous algorithms, 
waterfllling. 



I. Introduction 

THE interference channel is a mathematical model relevant 
to many physical communication channels and multiuser 
systems where multiple uncoordinated links share a common 
communication medium, such as digital subscriber lines 0, 
single (or multi) antenna cellular radio, ad-hoc wireless net- 
works |2|. 131. and cognitive radio systems (4). 

The interference channel is characterized by its capacity 
region, defined as the set of rates that can be simultaneously 
achieved by the users in the system while making the error 
probability arbitrary small. A pragmatic approach that leads 
to an achievable region or inner bound of the capacity region 
is to restrict the system to operate as a set of independent 
units, i.e., not allowing multiuser encoding/decoding or the 
use of interference cancellation techniques. This approach is 
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very relevant in practical systems, as it limits the amount of 
signaling among the users. With this assumption, multiuser 
interference is treated as additive colored noise and the system 
design reduces to finding the optimum covariance matrix of 
the symbols transmitted by each user. 

Within this context, in this paper we consider the maximiza- 
tion of mutual information in a fully distributed fashion using a 
game theoretical approach. Since the seminal paper of Yu et al. 
[7 1 in 2002 (and the conference version in 2001), this problem 
has been studied in a number of works during the past seven 
years for the case of SISO frequency-selective channels or, 
equivalently, a set of parallel non-interfering scalar channels 
|8|-|20|. In the cited papers, the maximization of mutual 
information is formulated as a strategic noncooperative game, 
where every SISO link is a player that competes against the 
others by choosing his power allocation (transmission strategy) 
over the frequency bins (or parallel channels) to maximize 
his own information rate (payoff function)^ Based on the 
celebrated notion of Nash Equilibrium (NE) in game theory 
(cf. 0, @), an equilibrium for the whole system is reached 
when every player's reaction is "unilaterally optimal", i.e., 
when, given the rival players' current strategies, any change 
in a player's own strategy would result in a rate loss. This 
vector-valued power control game was widely studied and 
several sufficient conditions have been derived that guarantee 
the uniqueness of the NE and the convergence of alternative 
distributed waterfllling based algorithms: synchronous sequen- 
tial (7)- Oil, synchronous simultaneous (HI, (BJ, [16), (191 , 
and asynchronous (T71 , (201 . 

Interestingly, different approaches have been used in the 
cited papers to analyze the game, most of them based on 
the following, apparently different, key results: 1) the in- 
terpretation of the waterfllling operator as a projection onto 
a proper polyhedral set (131 . (191 : 2) the interpretation of 
the Nash equilibria of the game as solutions of a proper 
affine Variational Inequality (VI) problem (121 : and 3) the 
interpretation of the waterfllling mapping as a piecewise affine 
function (TBI , (40l Ch. 4]. In this paper, we provide a unified 
view of these results, showing that they fit naturally in our 



The choice of the gender of the players is always controversial in the 
literature of game theory. This is reflected mainly on the use the third-person 
singular pronouns: some authors use "his", while others use "her", and others- 
more diplomatic ones-even use "his/her". English non-native speakers tend 
to use "its" to avoid the problem but that is not well accepted by native 
speakers. In two-player zero-sum games, the issue is even trickier and some 
authors resort to the use of one gender for the good player and another for 
the bad player. See the foreword in fj] for a related discussion on the issue. 
In this paper, for simplicity of notation and without further implications, we 
simply use "his". 
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interpretation of the waterfilling mapping as a projector [ 14], 
1 19 1 . Building on this interpretation and using classical results 
from fixed-point and contraction theory (cf. l22l . Il24l . ||25ll , 
[26 1), we then develop a mathematical framework useful to 
derive a unified set of sufficient conditions guaranteeing both 
the uniqueness of the NE and the convergence of totally 
asynchronous iterative waterfilling based algorithms. 

The proposed mathematical framework is instrumental to 
study the more general MIMO case, which is a nontrivial 
extension of the SISO frequency-selective case. There are 
indeed only a few papers that have studied (special cases of) 
the MIMO game EH, (23- EQ. In (23, the authors focused 
on the two-user MISO channel. In ED-ED, the authors 
considered the rate maximization game in MIMO interference 
channels, but they provided only numerical results to support 
the existence of a NE of the game. Furthermore, in these 
papers there is no study of the uniqueness of the equilibrium 
and convergence of the proposed algorithms. Finally, in IBTI , 
the authors showed that the MIMO rate maximization game is 
a concave game (in the sense of [32]), implying the existence 
of a NE for any set of arbitrary channel matrices [ 32 , Theorem 
1]. As far as the uniqueness of the equilibrium is concerned, 
they only showed that if the multiuser interference is almost 
negligible, then the NE is unique, without quantifying how 
small the interference must be. Hence, a practical condition 
that one can check to guarantee the uniqueness of the NE of the 
game and convergence of distributed algorithms is currently 
missing. 

The main difficulty in the MIMO case is that the optimal 
transmit directions (i.e., eigenvectors of the transmit covari- 
ance matrix) of each user change with the strategies of the 
other users, as opposed to the SISO frequency-selective case 
where only the power allocation depends on the strategies of 
the others, but the directions remain fixed: i) in the diagonal 
MIMO case, the transmit directions are always the canonical 
vectors iTTl- lfMl : ii) in the frequency-selective channel, the 
transmit directions are the Fourier vectors lTT8ll . fl9l ; iii) for the 
MISO case, the transmit directions are matched to the vector 
channels; and iv) for the SIMO case, there are no transmit 
directions to optimize. For the previous reason, the existing 
results and techniques in [7|-[20|, valid for SISO frequency- 
selective channels, cannot be applied or trivially extended 
to the MIMO case. On top of that, another difficulty is the 
fact that, differently from the vector power control game in 
|7|-|19|, where there exists an explicit relationship (via the 
waterfilling solution) among the optimal power allocations of 
all the users, in the matrix-valued MIMO game one cannot 
obtain an explicit expression of the optimal covariance matrix 
of each user at the NE (the MIMO waterfilling solution), as a 
function of the optimal covariance matrices of the other users, 
but there exists only a complicated implicit relationship, via 
an eigedecomposition. 

Building on the mathematical framework developed for the 
SISO case, we can overcome the main difficulties in the 
study of the MIMO game invoking a novel interpretation 
of the MIMO waterfilling operator as a projector and its 
nonexpansion property, similar to the one for frequency- 
selective channels. This enables us to derive a unified set 



of sufficient conditions that guarantee the uniqueness of the 
Nash equilibrium of the MIMO game and the convergence 
of totally asynchronous distributed algorithms based on the 
MIMO waterfilling solution. 

The paper is organized as follows. Section [TT] gives the 
system model and formulates the optimization problem as a 
strategic noncooperative game. In Section [III] we draw the 
relationship between Nash equilibria of the game and fixed 
points of nonlinear sets of equations, and provide the math- 
ematical tools necessary to study convergence of distributed 
asynchronous algorithms. Building on the interpretation of the 
multiuser waterfilling solution as a proper projection onto a 
convex set, in Section IIV1 we provide the main properties 
of the multiuser waterfilling solution either in the SISO or 
MIMO case, unifying previous results proposed in the litera- 
ture to study the rate maximization game in SISO frequency- 
selective channels. The contraction property of the multiuser 
waterfilling paves the way to derive sufficient conditions guar- 
anteeing the uniqueness of the fixed point of the waterfilling 
projector— alias the NE of the (SISO/MIMO) game-and the 
convergence of iterative, possibly asynchronous, distributed 
algorithms, as detailed in Sections [V] and I VII respectively. 
Section [VTIl reports some numerical results illustrating the ben- 
efits of MIMO transceivers in the multiuser context. Finally, 
Section [Villi draws some conclusions. 

II. System Model and Problem Formulation 

In this section we introduce the system model and formulate 
the optimization problem addressed in the paper explicitly. 

A. System Model 

We consider a vector Gaussian interference channel com- 
posed of Q links. In this model, there are Q transmitter- 
receiver pairs, where each transmitter wants to communicate 
with its corresponding receiver over a MIMO channel. The 
transmission over the generic q-th MIMO channel with nr q 
transmit and tir receive dimensions can be described with 
the baseband signal model 

y g = HggXg + ^ K rq X r + Ilg, (1) 

r=£q 

where x g <E C nTq is the vector transmitted by source 
g, H„eC" s ' x " T ' is the direct channel of link q, 
H rg S C nRqXnTr is the cross-channel matrix between source 
r and destination q, y g e C 7lRq is the vector received by desti- 
nation q, and n q G C nRq is a zero-mean circularly symmetric 
complex Gaussian noise vector with arbitrary covariance ma- 
trix R„ (assumed to be nonsingular). The second term on the 
right-hand side of (Q]) represents the Multi-User Interference 
(MUI) received by the q-th destination and caused by the other 
active links. For each transmitter q, the total average transmit 
power is 

£{\\x q \\ 2 2 }=Tr(Q q )<P q , (2) 

where Tr(-) denotes the trace operator, Q 9 = £{x 9 x^} is 
the covariance matrix of the transmitted vector x g , and P q is 
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the maximum average transmitted power in units of energy 
per transmission. 

The system model in (fl}-© provides a unified way to rep- 
resent many physical communication channels and multiuser 
systems of practical interest. What changes from one system 
to the other is the structure of the channel matrices. We may 
have, in fact, as particular cases of (Q}-(|2]i: i) digital subscriber 
lines [1], where the channel matrices are Toeplitz circulant, 
the matrices Q 9 = W Diag(p g )W ff incorporate the DFT 
precoding W ff , the vectors p q allocate the power across the 
frequency bins, and the MUI is mainly caused by near-end 
cross talk; ii) single (or multi) antenna CDMA cellular radio 
systems, where the matrices Q q = F q F^ contain in F q the 
user codes within a given cell, and the MUI is essentially 
intercell interference Q; iii) ad-hoc wireless MIMO networks, 
where the channel matrices represent the MIMO channel of 
each link 0. 

Since our goal is to find distributed algorithms that do 
not require neither a centralized control nor a coordination 
among the links, we focus on transmission techniques where 
no interference cancellation is performed and multiuser inter- 
ference is treated as additive colored noise from each receiver. 
Each channel is assumed to change sufficiently slowly to be 
considered fixed during the whole transmission, so that the 
information theoretical results are meaningful. Moreover, per- 
fect channel state information at both transmitter and receiver 
sides of each link is assumed]! eacn receiver is also assumed 
to measure with no errors the covariance matrix of the noise 
plus MUI generated by the other users. Finally, we assume that 
the channel matrices H qq are square nonsingular. The more 
general case of possibly rectangular nonfull rank matrices is 
addressed in ED . 

Under these assumptions, invoking the capacity expression 
for the single user Gaussian MIMO channel— achievable us- 
ing random Gaussian codes by all the users— the maximum 
information rate on link q for a given set of users' covariance 
matrices Qi, . . . , Qq is [33] 

R q (Q q ,Q- q ) = logdet(l + Hf g R: 9 (Q_ 9 )H OT Q 9 ) (3) 

where R_ g (Q_ g ) ^ R ?lq + £ H rg Q r Hf 9 is the MUI plus 

noise covariance matrix observed by user q, and Q 9 = 

(Qr)^Li r =jtq m tne set °f a ^ ^ e users ' covariance matrices, 
except the g-th one. 

B. Game Theoretical Formulation 

We formulate the system design within the framework of 
game theory using as desirable criterion the concept of Nash 
Equilibrium (NE) (cf. 0, J6)). Specifically, we consider 
a strategic noncooperative game, in which the players are 
the links and the payoff functions are the information rates 
on each link: Each player q competes against the others by 
choosing his transmit covariance matrix Q q (i.e., his strategy) 
that maximizes his own information rate R q (Q q , Q-g) in ©, 
subject to the transmit power constraint ©. A solution of the 

2 Note that each user q is only required to known his own channel H q(3 , 
but not the cross-channels {H r9 } r ^ 9 from the other users. 



game— a NE— is reached when each user, given the strategy 
profiles of the others, does not get any rate increase by 
unilaterally changing his own strategy. Stated in mathematical 
terms, the game has the following structure: 

maximize -R (QoiQ-o) 
(Sf): Q« Vqen, 

subject to Q 9 e J2 q , 

(4) 

where 57 = {1, . . . , Q} is the set of players (i.e., the links); 
Rq(Qq> Q-g) defined in © is the payoff function of player 
q; and £} q is the set of admissible strategies (the covariance 
matrices) for player q, defined as^] 

£ q = {Q G C ,lT * x " T « : Q h 0, Tr{Q} = P g }. (5) 

The solutions of game ^ are formally defined as followsQ 
Definition 1: A (pure) strategy profile Q* = (Q 9 ) en E 
J3i x . . . x J2q is a NE of game <3 if 

R q (Q q , Q%) > R q (Q q ,Q- q ), VQ 9 e £ q , Vq e n. (6) 

□ 

To write the Nash equilibria of game ^ in a convenient 
form, we first introduce the following notations and defini- 
tions. Given Sf, for each q £ ft and Q_,j S = J2\ x . . . x 
J2 q -i, x ... x £}q, we write the eigendecomposition of 
Hf g Rl 9 (Q_ g )H 9g as: 

Hf 9 RZ g (Q_ 9 )H 99 =U 9 D 9 Uf, (7) 

where U 9 = U 9 (Q_ g ) e C nT i xnT " is a unitary matrix with 
the eigenvectors, D q = D 9 (Q_ g ) e R++ 9 is a diagonal 
matrix with the riT q positive eigenvalues, and R_ 9 (Q_ 9 ) = 
R-n, + J2 H rg Q r H r9 . 

Given q 6 fl and Q g 6 &- q , the solution to problem (O 
is the well-known waterfilling solution (e.g., [33]): 

q; = wf,(q4 (8) 

with the waterfilling operator WF, (•) defined as 

WFjQ^U^I-D-yuf, (9) 

where U q = XJ q (Q^ q ), D q = D g (Q_ g ), and \i is chosen to 
satisfy Tr {(p q I - T> q 1 )+) = P q , with (x)+ = max(0,x). 

Using (© and Definition [T] we can now characterize the 
Nash Equilibria of the game in a compact way as the 
following waterfilling fixed-point equation: 

Q* = WF 9 (Q* g ) , Vqen. (10) 

Remark 1 - Competitive maximization of transmission rates. 
The choice of the objective function as in (© requires the 

^Observe that, in the definition of «S 9 in j5J the condition Q = is 
redundant, since any complex positive semidefinite matrix must be necessarily 
Hermitian 1411 Sec. 7.1]. Furthermore, we replaced, without loss of generality 
(w.l.o.g.), the original inequality power constraint in J2) with equality, since, 
at the optimum to each problem in (4)> the constraint must be satisfied with 
equality. 

4 Observe that, for the payoff functions defined in ^3), we can indeed limit 
ourselves to adopt pure strategies w.l.o.g., as we did in {5), since every NE 
of the game can be proved to be achievable using pure strategies 1 18 |. 
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use of ideal Gaussian codebooks with a proper covariance 
matrix. In practice, Gaussian codes may be substituted with 
simple (suboptimal) finite-order signal constellations, such as 
Quadrature Amplitude Modulation (QAM) or Pulse Ampli- 
tude Modulation (PAM), and practical (suboptimal) coding 
schemes. In this case, instead of considering the maximization 
of mutual information on each link, one can focus on the 
competitive maximization of the transmission rate, using finite 
order constellation, under constraints on transmit power and on 
the average error probability P* q (see fT8| for more details). 
Interestingly, using a similar approach to that in [18|, one can 
prove that the optimal transmission strategy of each user is 
still a solution to the fixed-point equation in ([Tol l, where each 
channel matrix H qq is replaced by H qq /T q , where T q > 1 
denotes the gap, which depends only on the constellations and 
on the target error probability P* q [34]. 

Remark 2 - Related works. The matrix nature of game 5f 
and the arbitrary structure of the channel matrices make the 
analysis of the game quite complicated, since none of the 
results in game theory literature can be directly applied to 
characterize solutions of the form ( TTOb . The main difficulty 
in the analysis comes from the fact that, for each user q, the 
optimal eigenvector matrix U* = U 9 (Q* ) in (O depends, in 
general, on the strategies Q* of all the others, through a very 
complicated implicit relationship— the eigendecomposition of 
the equivalent channel matrix H^RlJ(Q*)H gg . 
In the vector power control games studied in [7|-[20|, the 
analysis of uniqueness of the equilibrium was mathematically 
more tractable, since scalar frequency-selective channels are 
represented by diagonal matrices (or Toeplitz and circulant 
matrices fill . [19]), implying that the optimal set of eigenvec- 
tors of any NE becomes user-independent [18|. In the present 
case, it follows that, because of the dependence of the optimal 
strategy U*(Q* ) of each user on the strategy profile of the 
others at the NE, one cannot use the uniqueness condition of 
the NE obtained in [7|-[20| to guarantee the uniqueness of the 
NE of game Sf in ©, even if game £f reduces to the power 
control game studied in the cited papers, once the optimal 
users' strategy profile in ( TTOb is introduced in ®. At the best 
of our knowledge, the only paper where game S? was partially 
analyzed is 11311 . where the authors applied the framework 
developed in ll32l to the MIMO game and showed that the NE 
becomes unique if the MUI in the system— the interference-to- 
noise ratio at each receiver— is sufficiently small, but without 
quantifying exactly how much small the MUI must be. Thus, 
a practical condition that one can check to guarantee the 
uniqueness of the NE is still missing. 

To overcome the difficulties in the study of game Sf , we pro- 
pose next an equivalent expression of the waterfilling solution 
enabling us to express the Nash equilibria in (TlOb as a fixed- 
point of a more tractable mapping. This alternative expression 
is based on the new interpretation of the MIMO waterfilling 
solution as a proper projector operator. Based on this result, we 
can then derive sufficient conditions for the uniqueness of the 
NE and convergence of asynchronous distributed algorithms, 
as detailed in Sections [V] and |VT] respectively. 



III. Nash Equilibrium as a Fixed Point 

Before providing one of the major results of the paper— the 
contraction properties of the MIMO multiuser waterfilling 
projector— we recall and unify some standard results from 
fixed-point [25 1 and contraction theory |22l . Il24l that will be 
instrumental for our derivations (recall from (TlOb that any NE 
can be interpreted as a fixed point of the waterfilling mapping). 
The proposed unified mathematical framework is also useful to 
establish an interesting link among the alternative, apparently 
different, approaches proposed in the literature to study the rate 
maximization game in SISO frequency-selective interference 
channels II71- I1T41 . showing that most of the results in [7|-[14| 
can be unified by our interpretation of the waterfilling as a 
projector 0U, 03- 

A. Existence and Uniqueness of a Fixed-point 

Let T : X i— > X be any mapping from a subset X C W 1 to 
itself. One can associate T to a dynamical system described 
by the following discrete-time equation: 



x(ra + l) = T(x(n)), n e N+, 



(11) 



where x(n) G M™ is the vector of the state variables of the 
system at (discrete) time n, with x(0) = xo G K™. The 
equilibria of the system, if they exist, are the vectors x* 
resulting as a solution of x* = T(x*), i.e., the fixed-points 
of mapping T. The study of the existence and uniqueness 
of an equilibrium of a dynamical system has been widely 
addressed either in fixed-point theory (cf. [22|, [24|, 11251 ) or 
control theory (cf. [26], [35|, |36|) literature. Many alternative 
conditions are available. Throughout this paper, we will use 
the following. 

Theorem 1: Given the dynamical system in ([77} with T : 
X i— > X and X C R™, we have the following: 

a) Existence (cf. 11241 . [25]): If X is nonempty, convex and 
compact, and T is a continuous mapping, then there exists 
some x* such that x* = T(x*); 

b) Uniqueness (cf. (22), J24), EH ): If X is closed and T is a 
contraction in some vector norm ||-|| , with modulus a G [0, 1), 
i.e., 



T(x (1) ) -T(x 



{2y 



< a 



,(2) 



Vx 



(1) Y (2) 



G X, 
(12) 

□ 



then the fixed-point of T is unique. 

Remark 3 - Sufficiency of the conditions. The conditions of 
Theorem[T]are only sufficient for the existence and uniqueness 
of the fixed point. However, this does not mean that some of 
them can be removed. For example, the convexity assumption 
in the existence condition cannot, in general, be removed, as 
the simple one-dimensional example T(x) = —x and X = 
{— c, c} , with cgi, shows. 

Remark 4 - Choice of the norm. The contractive property of 
the mapping is norm- dependent, in the sense that a mapping 
may be contractive for some choice of the norm on M™ 
and, at the same time, it may fail to be so under a different 
norm. On the other hand, it may happen that a mapping is a 
contraction in more than one norm. In such a case, even though 
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the uniqueness of the fixed-point is guaranteed whatever the 
choice of the norm is (cf. Theorem [T), the convergence of 
different algorithms, based on the same mapping T, to the 
fixed-point is, in general, norm-dependent. Thus, the choice 
of the proper norm is a critical issue and it actually gives us 
potential degrees of freedom to be explored in the characteri- 
zation of the convergence properties of the desired algorithms 
used to reach the fixed-point. We address this issue in the next 
sections, where we introduce a proper norm, tailored to our 
needs. 

B. Convergence to a Fixed-point 

Nonlinear fixed-point problems are typically solved by itera- 
tive methods, especially when one is interested in distributed 
algorithms 11221 . Il24ll . In fact, the mapping x h- ► T(x) can be 
interpreted as an algorithm for finding such a fixed point. The 
degrees of freedom are in the choice of the specific updating 
scheme among the components of vector x, based on mapping 
T. More specifically, denoting by x = (x x , . . . , Xq) a partition 
of x, with x q E R™ 9 and n\ + . . . + tiq — n, and assuming 
X = X\ X • • • X Aq0 with each X q C E™" , the most common 
updating strategies for Xx , . . . , Xq based on mapping T are 
||22), ||24): 

i) Jacobi scheme: All components xi , . . . , xq are up- 
dated simultaneously, via the mapping T; 

ii) Gauss-Seidel scheme: All components xi , . . . , xq 
are updated sequentially, one after the other, via the 
mapping T; 

iii) Totally asynchronous scheme: All components 
xi , . . . , xq are updated in a totally asynchronous 
way (in the sense of |22]), via the mapping T. Ac- 
cording to this scheme, some components x g may be 
updated more frequently than others and, when they 
are updated, a possibly outdated information on the 
other components can be used. Some variations of 
such a totally asynchronous scheme, e.g., including 
constraints on the maximum tolerable delay in the 
updating and on the use of the outdated information 
(which leads to the so-called partially asynchronous 
algorithms), can also be considered l22l . 

Observe that the latter algorithm contains, as special cases, the 
first two ones. In general, the above algorithms converge to the 
fixed-point of T under different conditions E2} . Il24l . How- 
ever, we can obtain a unified set of convergence conditions 
(not necessarily the mildest ones) by studying the contraction 
properties of mapping T under a proper choice of the norm. To 
prove the convergence of the totally asynchronous algorithms, 
a useful norm is the so-called block-maximum norm, defined 
as follows. According to the partition xi, . . . ,xq of x and 
T = (T g )^ =1 , with T q : X i-> X q , let ||-|| ? denote any vector 
norm on M. nq for each q, the block-maximum norm on K" is 
defined as fl22), ||24l 

l|T|| block = max||T 9 || 9 . (13) 

5 For the sake of simplicity, we focus on mappings T whose domain can 
be written as the Cartesian product of lower dimensional sets, associated to 
the partition of the mapping. For our purposes, this choice is enough, since 
the joint admissible strategy set of game '$ satisfies this condition. 



The mapping T is said to be a block-contraction with modulus 
a € [0, 1) if it is a contraction in the block-maximum norm 
with modulus a. A unified set of convergence conditions for 
distributed algorithms based on mapping T is given in the 
following theorem, whose proof follows the same steps as in 
ll20l Appendix A] and is omitted here because of the space 
limitation (see also ED). 

Theorem 2: Given the mapping T = (T q )® =1 '■ X 1— > X, 
with X = X\ x • • ■ x Xq, assume that T is a block-contraction 
with modulus a G [0,1). Then, the totally asynchronous 
algorithm {cf. [22]) based on the mapping T asymptotically 
converges to the unique fixed-point of T, for any set of initial 
conditions in X and updating schedule. □ 

Theorem [2] provides a unified set of convergence conditions 
for all the algorithms that are special cases of the totally asyn- 
chronous algorithm. Weaker conditions can still be obtained if 
one is not interested in a totally asynchronous implementation. 
For example, if only the Jacobi updating scheme is considered, 
to prove the contraction property of T, one can use any 
arbitrary norm on W 1 l22l. 1251. 

C. Contraction Theory, Lyapunov Function, and Variational 
Inequality Problems 

Contraction theory is not the only instrument available 
to prove the convergence of distributed algorithms to the 
fixed-point of a mapping T. So far, we have seen that any 
mapping T defines a dynamical system (see (fTTl) ). Hence, the 
convergence of distributed algorithms to the fixed-point of T 
can be reformulated as the study of the globally asymptotic 
stability of the equilibrium of a proper dynamical system, 
based on T. From this perspective, Lyapunov theory is a 
valuable instrument to study the system behavior and, as a 
by-product, the convergence of distributed algorithms l26l . 
[ 3 5 1 . Indeed, the contraction property of mapping T in the 
vector norm ||-|| implies the existence of a valid Lyapunov 
function for the dynamical system in ( TTTI) ll26ll . 11351 . given 
by V(x) = ||x— x*| . This guarantees the convergence of a 
Jacobi scheme based on mapping T. Interestingly, in the case 
of contraction of mapping T in the block maximum norm ( TT3l . 
the Lyapunov function V(x) = ||x— x*|| block can be interpreted 
as the common Lyapunov function of a set of interconnected 
dynamical systems, each of them associated to the partition 
xi, . . . ,xq of x J36). 

Finally, it is interesting to observe that the convergence to 
the fixed-point of a mapping T can also be studied introducing 
a proper transformation of T that preserves the set of the fixed- 
points. A useful tool to explore this direction is given by the 
variational inequality theory (39), ll40l (see, e.g., lTT2l . [19| 
and Section IIV-AI for an application of this framework to the 
multiuser waterfilling mapping). 

We are now ready to apply the previous general framework 
to the multiuser waterfilling mapping in (0, as detailed next. 

IV. Contraction Properties of the Multiuser 
Waterfilling Mapping 

So far we have seen that a unified set of conditions 
guaranteeing the uniqueness of the NE and the convergence 
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of totally asynchronous algorithms to the fixed-point Nash 
equilibria of game Sf can be obtained deriving conditions for 
the multiuser waterfilling mapping in (O to be a contraction 
in a proper block-maximum norm (see Theorems Q] and [2]). 
In this section we then provide a contraction theorem for 
the multiuser waterfilling operator. Our result is based on 
the interpretation of MIMO waterfilling operator as a matrix 
projection onto the convex set of feasible strategies of the 
users. This result is also useful to obtain a unified view of, 
apparently different, techniques used in the literature to study 
the uniqueness of the NE and the convergence of alternative 
waterfilling based algorithms in the rate-maximization game 
over SISO frequency-selective Gaussian interference channels 
Q-J20). To show this interesting relationship, we start from 
an overview of the main properties of the Nash equilibria of 
game in the case of SISO frequency-selective Gaussian 
interference channels, as obtained in lfT4l . Ifl9l , 11201 , lfT2ll . 
iPHfl , and Q, |[T31 , and then we consider the more general 
MIMO case. 

A. Frequency-Selective Gaussian Interference Channels 

In the case of block transmission over SISO frequency- 
selective channels [7|-[20|, each channel matrix H rq S C NxN 
becomes a Toeplitz circulant matrix and tit = ur = JV, 
where N is the length of the transmitted block (see, e.g., [ 18 1). 
This leads to the eigendecomposition H rg = WD r ,W fl , 
where WeC M is the normalized IFFT matrix, i.e., 
[W] y = e*2*(*-i)W-i)/w /y/N for i,j = 1, . . . , N and T> rq 
is a N x N diagonal matrix, where [D r(J ] fcfc = H rq (k) is the 
frequency-response of the channel between source r and des- 
tination q at carrier k, and R Jl? = Diag(cr^(l), . . . ,a q (N)). 
Under this setup, the matrix game in reduces to a simpler 
power control game, where the strategy of each user q becomes 
the power allocation p q = \p q (l), . . . ,p q (N)] T over the N 
carriers and the admissible strategy set in Q reduces tc@ 



x e 



N 
k=l 



Pa 



Vk€{l,...,N} 



(14) 

and & = x . . . x &q. It follows that the optimal strategy 
at any NE must satisfy the simultaneous multiuser waterfilling 
equation: 

p* = wf,(p* g ), Vqen, (15) 

where p q = {p r ) ren r ^ q , and the waterfilling operator 
wf 9 (•) becomes lfl9l 



[wf, (p- q )] k = H q 



V 2 q (k)+Er^\Hr q (k)\ 2 p r (ky 

\H qq (kW 



(16) 

for k € {1, . . . , N}, with the waterlevel /i ? chosen to satisfy 
the power constraint J2k=iPg(^) = Pq- 

6 In the transmissions over frequency-selective channels, in addition to the 
power constraints as in U4t . it may be useful to introduce spectral mask 
constraints, in order to impose radiation limits over some (licensed) bands 
1121 . [191 , [201 , 1141 . Interestingly, all the results presented on this section 
are valid also in the presence of spectral mask constraints, as proved in 1 12], 

El, El, (H. 



Different approaches have been proposed in the literature 
to study the properties of the Nash equilibria in ( fT3T >, each 
time obtaining milder conditions for the uniqueness and the 
convergence of distributed algorithms [7|-[20|. We provide in 
the following a unified view of the techniques used in the 
cited papers, based on the mathematical framework described 
in Section Hill 

Approach #1: Multiuser waterfilling as a projector |14|, 

1 19 1 - We introduce first the following intermediate definitions. 
For any given p- 9 , let insr,j(p_ 9 ) be the TV-dimensional 
vector defined as 

, , „ a ^(fc) + E r ^l^(fc)|V(fc) 
[' nsr «(p-^ = j^p • < 17 > 

with k 6 {1,...,7V}. In order to apply Theorem [2] we 
introduce a proper block-maximum norm for the multiuser 
waterfilling mapping wf = (wf«j(p_ g )) 9e n in ( fTBI l (cf. 



Section IIII-Bb . Given some w = [t 



,wq] t > 0, let 



II "II 2 block denote the (vector) block-maximum norm, defined 
as 1221 



|wf(p) 



A 

= max 



l wf ,(P-9)| 



where 



is the Euclidean norm. Let 



(18) 

be the vector 



weighted maximum norm, defined as [41 1 



and let 



given 



by EU 



= max^, w > 0, xeF 
,ec gen w q 

denote the matrix norm induced by 



(19) 



1 Q 

!l A Cmat = m ax— £|[A], r | Wr, AG 



(20) 



We also introduce the nonnegative matrix S max € 
defined as 

\H qr (k)\ 2 



max 



J qr 



ke{i,...N} \H qq (k)\^ 
0, 



if r^g, 



(21) 



otherwise. 



sc 



Using the above definitions and denoting by [xo] 
argmin zg jr ||z— xo|| 2 the Euclidean projection of vector Xo 
onto the convex set 5tf , in lfl4l . Ifl9l we proved the following. 



Lemma 1 (Waterfilling as a projector): The waterfilling 
operator wf g (p_ g ) in d 1 6b can be equivalently written as 

wf, (p-,) = [-insr g (p_,)]^ , (22) 

where and insr g (-) are defined in (fl4l and ( Tf7T l. respec- 
tively. □ 
It follows from Lemma Q] that the Nash equilibria in ( TTBT ) 
can be alternatively obtained as the fixed-points of the mapping 
defined in (1221) 



p\ = [-insr^p*^]^ , Vqen. 



(23) 



Lemma [T] is also the key result to study contraction properties 
of mapping wf and thus, based on ( f23l , to derive conditions 
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for the uniqueness of the NE and convergence of distributed 
algorithms. The main result is summarized in the following 
theorem that comes from flT9l Proposition 2]Q 

Theorem 3 ( Contraction property of the mapping wf): 
Given w = [w\, . . . ,wq] t > 0, the mapping wf defined in 
dT6b is Lipschitz continuous on 3?: 

wf( P (1) )- wf( P (2) ; 



< 



2, block 



omax 1 1 w 

I loo, mat 

< llpM -p( 2 ) 



VpM, p (2) g @> t w / iere 



2. block ' 



2, block ' 

(24) 

, and S max are 



defined in d 1 8t > . d20t . ant/ ( 12 U . respectively. Furthermore, if 



3 max 1 1 w 



< 1, 



(25) 



f/zetf mapping wf is a block-contraction with modulus 

l|S max ||™ mal . □ 

Given Theorem [3] it follows from Theorem [T] and Theorem 
|2j that condition ||S max ||™ mat < 1 is sufficient to guaran- 
tee the uniqueness of the NE of the game as well as the 
convergence of totally asynchronous algorithm based on the 
waterfilling mapping wf in ( fT6T > [19|, |20|. 



Approach #2: Multiuser waterfilling as solution of an 
Affine VI lfT2l - In 1121 . the authors established an interesting 
reformulation of the rate maximization game as a linear 
complementarity problem (LCP) [39|. More specifically, they 
proved that the nonlinear system of KKT optimality conditions 
of the g-th user convex problem in Sf, given by [where 
a _L b means that the two scalars a and b are orthogonal, 
i.e., a ■ b = ofl 



\H qq (k)f 



+ M 9 ~ V q.k = 0, 



Vfc, 



0<M 9 ^ (P q -EtiPi(k)) >o, 

0<u q , k ±p q (k)>0, Vfc, 

(26) 

is equivalent to [12, Proposition 1]: 



< p,(fc) 



l^ 9 (fc)| 2 



ltf w (fc)l 2 



Vfc, 



A„ 



free < J2k=iPi( k ) = p i- 

(27) 

As observed in [12|, (l27b for all q E ft represents 
the KKT conditions of the Affine VI (AVI) ct,M) 
defined by the polyhedral set 3? and the affine map- 
ping p i— > <x + Mp (see [ 39 1 for more details on 
the AVI problems), where p = [pj , . . . , Pq] t , <x = 



o-q] t , and M is a block partition matrix M 



[M 



with p„ 



[ Pq (l),..., Pq (N)Y 



[al(l)/\H qq (l)\\...,at(N) /\H qq (N)\ 2 f and M 9r 4 

7 We refer the interested reader to [19|, [20] for mode general results on 
contraction properties of the wf mapping. 

8 Observe that in [12], the authors considered equivalently for each user q € 
Q the power constraint ^2^—iPq(k) < Pq, rather than ^2^—iPq(k) = P q , 
as we did in 114) . 



Diag (\H rq (l)\*/\H qq (l)\*, \H rq (N)\y\H qq (N)f), for 
r, qett. 

It follows that the vector p* E 3? is a NE of the game 
if and only if it satisfies the AVI a,M) (39), EO): 



(p-p*) j (ar + Mp*) >o, v p g 



(28) 



Building on this result and the properties of AVI problems 
(cf. [39], 1 40]), the authors in [12] derived sufficient conditions 
for the uniqueness of the Nash equilibrium and the global 
convergence of synchronous sequential IWFA. 

It can be show that the AVI (J 2 , <r, M) in d28l i is equivalent 
to the fixed-point equation in ( 1231 [40|, |39|, establishing the 
link between the solutions to (|28T> — the fixed-points of the 
waterfilling mapping wf in (TTST)— and the interpretation of the 
mapping wf as a projection (Lemma [TJ, as given in fl4l . 
|190 In fact, the convergence conditions obtained in [12| for 
the synchronous sequential IWFA coincide with d25b (for a 
proper choice of vector w |19])0 Observe that they are a 
special case of those obtained in [20, Corollary 2]. 



Approach #3: Multiuser waterfilling as a piecewise affine 
function |16|, [40, Ch. 4] - In JT6), the authors proved global 
convergence of different waterfilling based algorithms building 
on the key result that the waterfilling mapping wf : B 21 1— ► B 2 * 
can be equivalently written as a piecewise affine (PA) function 
on R QN [40 Ch. 4]. In fact, the result in d follows the 
same steps as Propositions 4.1.1 and 4.2.2 in ]|40l Ch. 4] and, 
interestingly, can be obtained directly from our interpretation 
of the wf as a projection (Lemma Q]) and some properties of 
the PA functions in j40l Ch. 4], as detailed next. We introduce 
the following intermediate definitions first. 

Definition 2 ( 11401 Def. 4.1.3]): A continuous function f : R™ >— > 
R m is said to be piecewise affine (PA) if there exists a finite 
family of affine functions {ffc(x) = A/jX + ^Ok\k=i f or some 
positive integer K and {(A-k>^>k)}k—v with each ffc : R" i— > 
R"\ such that for all x G R™, f(x) € {fi(x), . . . , f A '(x)}. □ 

PA functions have many interesting properties (we refer 
the interested reader to l40l Ch. 4] for an in-depth study of 
the theory of PA functions). Here, we are interested in the 
following one, which follows directly from ll40l Proposition 
4.2.2 (c)]. 

Lemma 2: Any PA map f : R" ^ R m is globally Lipschitz 
continuous on R": 



f(x 



f(x (2) ) 



; a 



(1) _ x (2) 

Vx (1) , x (2) € R", (29) 



with Lipschitz constant a = maxj. 6 { lj ,.. t K} \\A-k\\m-n, where 
|j-|j vec is any vector norm and |j-|| mat is the matrix norm 
induced by ||-|| ve0 . □ 

The link between our interpretation of the wf mapping as a 
projector (Lemma Q]) and the interpretation of wf as PA map 
( ifTBI Theorem 5]) is given by the following ll40l Prop. 4.1.4]. 

'Papers 1181 , 1191 have been originally submitted on September 2004. 
'"Recall that condition )25t is valid also for the convergence of the 
asynchronous IWFA (Theorem ff) 1201 . 
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Lemma 3: Let X be a polyhedral set in R n F*1 Then, the 
Euclidean projector onto is a PA function on R n . □ 

According to Lemma [T] for any given p > 0, the wa- 
terfilling wf(p) in ( TT6T > is the Euclidean projector of vector 
-insr(p) = -[insr^p^x) 71 , . . . ,insr Q (p_ Q ) T ] T onto the 
convex set which is a polyhedral set. It then follows from 
Lemma 3 that wf (p) is a PA function, i.e., there exists a finite 
family of affine functions {ffe(p) = AkP + hk}^ =1 such that, 
for every p > we have 



wf(p) = A k p + b h 



for some k e {1, . . .K}, (30) 



which coincides with the result in lfl6l Theorem 5]. The 
expression of the affine pieces {(Ak,hk)}^ =1 of wf(p) can 
be obtained exploring the structure of the waterfilling solution 
(I16t . We omit the details here because of space limitations 
(see ([16, Theorem 5]). 

Contraction properties of the wf operator interpreted as 
PA map on SP N as stated in |[T6l Theorem 7] are a direct 
consequence of properties of the PA functions. In particular, 
it follows from Lemma [2] that, if 



max 
k<={l,...,K] 



<1, 



(31) 



then the wf mapping is a contraction in the vector norm || • 
||vec- Exploring different vector norms, one can easily obtain 
different sufficient conditions for the uniqueness of the NE of 
the game and the convergence of distributed algorithms, based 
on the wf mapping. Observe that, according to Theorem[2] if a 
block-maximum norm is used in d29l > (see also Theorem [3] and 
Theorem [5] in Section IIV-B21 >, then condition OTb guarantees 
also the global convergence of totally asynchronous algorithms 
ED, El- 
Approach #4: Multiuser waterfilling via the max-lemma 
[71, [15 1 - In [15 1, the authors, among all, derived conditions 
for the global convergence of the synchronous sequential and 
simultaneous iterative waterfilling algorithms (implying also 
the uniqueness of the NE). Conditions in IPT51 generalize 
those obtained in Q for the two-player game to the case 
of arbitrary number of players. We show now that results in 
[fl~5l come from Theorem [2] as special case, if a proper vector 
norm is chosen. To this end, we introduce first the following 
intermediate definitions and results. 

A key point in the proof of convergence in [ 15] is given by 
the following max-lemma ||T5l Lemma 1]. 

Lemma 4: Let / : I h I and g : R i— > M be a non- 
decreasing and non-increasing functions on R, respectively. 
If there exists a unique x* such that f(x*) — g(x*), and 
function f and g are strictly increasing and strictly decreasing 
at x — x* , then 



argmin{max{/(x), g{x)}}. 



(32) 
□ 



"Roughly speaking, a polyhedral is the intersection of a finite number of 
halfspaces and hyperplanes (see, e.g., 1421 Ch. 2.2.4]). 



Let us introduce the following error dynamic, as defined in 



|15| 



e q (n + 1) = max • 



[p^ +i \k)-p { :\k) 



(33) 



with 



where p q n ' 



: N+ = {0,1,2,...}, 

,p q (N)] T is the vector of the power allocation 
of user q G f2, generated at the discrete time n by the sequen- 
tial or simultaneous IWFA [starting from any arbitrary feasible 



point p(°)], 



max(0,x), and [x] = max(0, — x). 



Using Lemma [4] and following the same steps as in [15| 
results in [15| can be restated in terms of the error vector 



s (n+l) A rJ^+i) 



,(«+!) 



(n+l)l 

3 Q 



as: 



< a 



where a = (Q — 1) max*,, r ^ q 



\H rq W 



Vn G PL 



, and 



(34) 



the loo norm [see jl9i with w = 1]. It follows from ( 134b that, 
under a < 1, both synchronous sequential and simultaneous 
IWFAs globally converge to the unique NE of the game |f71 , 
(15). Observe that this condition implies the contraction of 
the waterfilling mapping as given in Theorem [3] showing the 
more generality of our sufficient condition ( |25] | than that in 

fa, ma. 

Extension: We generalize now the results in Q, lfT31 . so that 
we can use Theorem [2] and enlarge the convergence conditions 
of 0, 03), making them to coincide with d25l l and valid also 
for the asynchronous IWFA [20|. To this end, we introduce a 
new vector norm, as detailed next. 

Inspired by (l33l . we introduce the following norm: 



-{\\(*y 



I l.vec ' 



|(x)- 



ec J 7 

(35) 

| j vec denotes the l± norm BTI . Some properties of 
1 1 ' 1 1 1 oo vec are listed in the following lemma (we omit the proof 
because of space limitation). 

Lemma 5: The norm \\■\\^ „ in (I35l> is a valid vector 



where 



1 , oo ,vec 

norm (in the sense that it satisfies the axioms of a norm J4T 
Moreover, the following nonexpansion property holds true* 12 



Ox 1 - Xq) 



UV 



yo 



l,oo,vec 



< ||X0 - yo|| l!OC!Vec , 

Vx ,y eR^,(36) 



where the waterlevels \i x and fi v satisfy 1 T (fJ> x l — Xo) = 
1 T (/^yl — yo) = Pt, (with Pt an arbitrary nonnegative 
number), and 1 denotes the N -dimensional vector of all ones. 
□ 

Interestingly, Lemma [5] provides the nonexpansion property 
of the (single-user) waterfilling solution in the vector norm 
II '111 oo vec- I* a ^ so re P resen ts the key result to prove the 
contraction property of the multiuser waterfilling mapping wf 

12 Interestingly, one can prove that the nonexpansion property as stated in 
i36t also holds true if in B6t the norm [|'||, vcc is replaced by the l\ norm. 
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in ( fT6l ) in the block-maximum norm IHI™^ block associated to 
ll'lli.oo.vec defined, for each w = [wi, . . . ,wq] t > 0, as 

|| wf „(PV" 



iiwf( P )iir: 



oo, block 



max ■ 



I l,oo, vec 



(37) 



The main result is stated next (the proof is based on Lemma 
[5] and follows similar steps of that in [20, Appendix A]; see 
also Theorem in Section HV-B2b . 

Theorem 4 (Contraction property of mapping wf): Given 
w = [wi, . . . , wq] t > 0, the mapping wf defined in (1161 l is 
Lipschitz continuous on 3?: 



wf(p (1) ) - wf(p w ) 



(2)< 



< 



1, 00, block 



Cmax II w 

1 1 00, mat 

I (1) (2)|| w 

1^ r II l,oo, block ' 

(38) 



and S 1 



are 
□ 



Vp« p( 2 ) e where ||C, 
defined in (137b . i20l , and (12U . respectively 

Comparing Theorem [3] with Theorem |U one infers that 
both theorems provide the same sufficient conditions for the 
waterfilling mapping wf to be a contraction and thus the same 
sufficient conditions guaranteeing the uniqueness of the NE 
and the convergence of asynchronous IWFAs lfl9l . Il20l . 

B. MIMO Gaussian Interference Channels 

In this section, we generalize our interpretation of the water- 
filling projector in the frequency-selective case to the MIMO 
multiuser case. For the sake of simplicity, we concentrate on 
MIMO systems whose direct channel matrices H qq are square 
and nonsingular. The more general case is much more involved 
and goes beyond the scope of the present paper; it has been 
considered in ETI . 

1) Multiuser waterfilling in Gaussian MIMO interference 
channels : We first introduce the following intermediate result. 



- U H (R„ + HXH H j H + AI = *, (42) 

* b 0, X y 0, Tr(*X) = 0, (43) 

A>0, A (Tr(X) — P T ) = 0, Tr(X) < P T . (44) 

First of all, observe that A must be positive. Otherwise, (l42l) 
would lead to 



y H H ( R 



HXH 



H = * y 0, 



(45) 



which cannot be true. We rewrite now d42l-(l44li in a more 
convenient form. To this end, we introduce 



X ^ - (H^R- 1 !!)- 1 ^ 0, 



so that 

a H ^Rn 



HXH 



H 



H = (X - X ) >0. 



(46) 



(47) 



Then, using the fact that A > and absorbing in d42l-(l43l the 
slack variable system (I42l-(l44l can be rewritten as 



X 



(X-X ) 



AI 



0, 



(48) 

x y 0, - (x - x ) _1 + ai y 0, (49) 

A > 0, Tr(X) = P T , (50) 
where in (|48T > we have used the following fact ll38l fact 8.10.3] 

Tr(*X) = ^ *X = 0, 
Since A > 0, (|48]l-(|50]l become 



V*,Xb0. (51) 



X 



(X-Xo)--I 



= 0, 



x y 0, -(x-x ) +ai y o, 

A>0, Tr(X) = P T . 
We show now that <f52T>-(f54b is equivalent to 



(52) 

(53) 
(54) 



Proposition 1: Given R„ y 0, H G C™ x ", and P T > 0, 
let define the following two convex optimization problems: 



(PI): 



maximize log det ( R„ + HXH 
subject to Tr{X} < Pt, 



(39) 



and 



(P2) 



minimize 



HX-Xoll^. 
subject to Tr{X} = P T . 



(40) 



If Xo in (P2) is chosen as Xo = — (H ff R,^ 1 H) 1 , then both 
problems (PI) and (P2) have the same unique solution. 

Proof: Problem (PI) [and (P2)] is convex and admits a 
unique solution, since the objective function is strictly concave 
(and strictly convex) on X y 0. The Lagrangian function L 
associated to d39l is 



C = — log det ^R„ + 



HXH 



H 



-Tr(*X)+A(Tr(X) - P T ) , 
(41) 

which leads to the following KKT optimality conditions 
(Slater's conditions are satisfied ||42] Ch. 5.9.1]): 



X [(X - X ) + M I] = 0, (55) 
X yo, X - X + fjl h 0, (56) 
/i = free, Tr(X) = P T . (57) 

(El => d55>-d5V>: Let (X, A) be a solution of (|52]|-<|54]|. A 
solution of (l55ll-(l5"7Ti is obtained using (X, /i), with \i = — i-. 
d55]l-(l57ll => d52]i-(|54ll: Let (X, /i) be a solution of <|55]l-<|57]l. It 
must be fi < 0; otherwise, since X — X y [see (l46l)1. 05b 
would lead to X = 0, which contradicts the power constraint 
in fl57l ). Setting A = — — , it is easy to check that (X, A) satisfies 



The system (T55T>-(T57l> represents the KKT optimality con- 
ditions of problem (l40l with Xo defined in d46l >; which 
completes the proof. ■ 

Denoting by [Xo]^ the matrix projection of Xo with 
respect to the Frobenius norm onto the set £} q defined in 
©-the solution to problem (P2) in (|40j with P T = P g -and 
using Proposition Q] we have directly the following 

13 A more general expression of the waterfilling projection valid for the 
general case of singular (possibly) rectangular channel matrices is given in 

ED 
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Lemma 6: The waterfilling operator WF, (Q-q) in © 
can be equivalently written as 



WF, (Q_ g ) = [- (Hf,Rli(Q_,)H. 



L qqf 



(58) 



□ 



where J3 q is defined in Q. 

Comparing ( TTOb with d58l . it is straightforward to see that 
all the Nash equilibria of game $ can be alternatively obtained 
as the fixed-points of the mapping defined in ((58): 



Q* = [- (H*R_i(Q* )H 



qq) 



WqeQ. (59) 



Remark 5 - Nonexpansive property of the MIMO waterfilling 
operator. Thanks to the interpretation of MIMO waterfilling in 
(0 as a projector, one can obtain the following nonexpansive 
property of the waterfilling operator that will be used in the 
next section to derive the contraction properties of the MIMO 
waterfilling mapping. 

Lemma 7: Given q E tt, let denote the matrix pro- 

jection onto the convex set 2l q with respect to the Frobenius 
norm, as defined in (140l l. Then, satisfies the following 

nonexpansive property: 



[Y] 



< IIX-YIL, VX,YG 



(60) 
□ 



2) Contraction property of MIMO multiuser waterfilling: 
Building on Lemmas [6] and [7] we derive now sufficient 
conditions for the waterfilling mapping to be a contraction, 
under a proper norm. Our result is the natural extension of 
Theorem [3] to the MIMO case. 

As in the SISO case, we define first an appropriate block- 
maximum norm for the multiuser waterfilling mapping. Given 



WF(Q) = (WF,(Q_,)), efl : & h- B, 



(61) 



where 3, = £ x X ■ • • X =Sq, with £ q and WF 9 (Q_,) defined 
in (0 and ( 1581 . respectively, we introduce the following block- 
maximum norm on C" x n , with n = + . . . + tit q , defined 
as 



|WF(Q)|| 



F, block 



= max ■ 

960 



|WF 9 (Q_,) 



(62) 



where ||-|| F is the Frobenius norm and w = [w\, . . . ,wq] t > 
is any positive weight vector. Finally, let S G R^ x< ^ be the 
nonnegative matrix defined as 

rqi A f P (H^H^H^H.^) , if r ^ q, 

[ V \ 0, otherwise. { ' 

where p (A) denotes the spectral radius[3 of A. The con- 
traction property of the waterfilling mapping is given in the 
following theorem. 



14 The spectral radius p (A) of the matrix A is defined as p (A) = 
max{|A| : A £ cr(A)}, with <r(A) denoting the spectrum of A 1411 . 



Theorem 5 (Contraction property of mapping WF): 
Given w = [iui, . . . , wq] t > 0, the mapping WF defined in 
(1611 is Lipschitz continuous on J2: 



||WF(Q«) - WF(Q( 2 ))|| Fb]ock < ||S||- mat 

x iio (i) - o (2) ir 

* || W *4 Unblock ' 

(64) 

VQ (1) , Q (2) G =2, where \\-\\Jj,, ock , ||-|lro,mat and S are defined 
in ( 162b . ( I20l i. and ( I63l l. respectively, and J2 = £H\ x • ■ ■ x J2q, 
with M q given in ([5). Furthermore, if 

l|SC, mat <l, (65) 
then mapping WF is a block-contraction with modulus a = 

|S|loo,mat- 

Proof: The proof of the theorem in the general case of ar- 
bitrary channel matrices is quite involved 11211 . Here, we focus 
only on the simpler case in which the direct channel matrices 
{Hqqlggn are square and nonsingular. Under this assumption, 
according to Lemma [6] each component WF(Q_,) of the 
mapping WF can be rewritten as in (l58l . 

The proof consists in showing that the mapping WF 
satisfies d64]i, with a = ||S||™ mat . 



Given Q 



(i) 



Q 



G £ and Q< 2 ) 



(q, ( i\ Q^ 2) ) G 3, let define, for each q E fl, 



A 

e WF, - 




:< 


-WF, (QL 2 >) 


1 

F 


(66) 


A 

e q = 


Q 9 1} - 




F ' 




(67) 



Then, we have: 



ewF„ - 



(h^r-^q 



<l'l 



(2) )R 



Bq 



r 









(68) 



< 



'/'/ 



- (H^R ? 1 (Q { ?l)H. qq 



H-M^H rg (q«-q( 2 >)h? 9 

\r^q 

< £ P (H^H/H-, 1 ^) J (QW - Q 

r^q 



q I "-qq 



H 



(2: 



r^q 



(69) 

(70) 

(71) 
(72) 



VQ (1) , Q (2) G £ and \fq G fi, where: dSU follows from 
(Lemma d69l follows from the nonexpansive property of 
the projector in the Frobenius norm as given in d60t (Lemma 
0; (ITOl i follows from the nonsingularity of the channel matri- 
ces {H gg }; (T7TT > follows from the triangle inequality [41 1 and 
from EQ 



AXA 



H 



< A max (A H A) ||X|| F , 



(73) 
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where X = X H and A e 



and d72l follows from the 



definition of matrix S in < f63T >. 
Introducing the vectors 

ewF — [ewFu ■ ■ ■ j ewF Q ] T , and e = [ei,. 



, (74) 



with ewF, and e q defined in d66l l and d67l i. respectively, the 
set of inequalities in d72l ) can be rewritten in vector form as 



< e WF < Se, VQ (1) , Q (2) e B. 



(75) 



Using the weighted maximum norm ||-||™ vec defined in (|T9] > 
in combination with (fTBT l. we have 



^WFllocvec < II Ses ll oo.vec 



— Il^lloo.mat Halloo, vec ' 



(76) 

VQW,Q( 2 ) G £? and w > 0, where \\-\\Z,rmt is the matrix 
norm induced by the vector norm ||-||™ vec in ( fT9l l and defined 
in d20j l 1411 . Finally, using (|76l l and d62l i, we obtain, 



WF(Q (1) ) - WF(Q 



(2) 



F.block 

< iisir 



ewF 



Q(2)| 



oo,mat -« I IF. block : 

(77) 

VQ (1) ,Q (2) G £ and Vw > 0, which leads to a block- 
contraction for the mapping WF if ||S||™ mat < 1, implying 
condition 



V. Existence and Uniqueness of the NE 

Using results obtained in the previous section, we can 
now study game <S and derive conditions for existence and 
uniqueness of the NE, as given next. 

Theorem 6: Game §f always admits a NE, for any set of 
channel matrices and transmit power of the users. Further- 
more, the NE is unique if 

P(S)<1, (CI) 



where S is defined in 

Proof: According to the interpretation of the waterfilling 
mapping WF in (O as a projector (cf. Lemma[6]), the existence 
of a NE of game & is guaranteed by the existence of a solution 
of the fixed-point equation d59l ). Invoking Theorem [Tf a), the 
existence of a fixed-point follows from the continuity of the 
waterfilling projector d58l ) on =2, for any given set of channel 
matrices {H r q} r .qen (implied from the continuity of the 
projection operator [22, Proposition 3.2c] and the continuity 
of each Rl*(Q_ g ) on ii?-J ls r). and from the convexity and 
compactness of the joint admissible strategy set i20 

According to Theorem |TJb), a sufficient condition for the 
uniqueness of the NE of game Sf is that the waterfilling 
mapping WF in (O be a contraction with respect to some 
norm. It follows from Theorem [5] that WF is a block- 
contraction if condition d65l) is satisfied for some w > 0. 



l5 This result can be proved using 1371 Theorem 10.7.1]. 

16 According to 1321 Theorem 1], the existence of a NE of game @ can 
also be proved showing that the game is a concave game: 1) The set Sl q of 
feasible strategy profiles of each player q is compact and convex; and 2) The 
payoff function of each player q is continuous in Q £ J and concave in 
Qq 6 Sq, for any given Q_ g £ 3- q (this follows from the concavity of 
the log function 1421 ). 



Since S in d65l ) is a nonnegative matrix, there exists a positive 
vector w such that [22, Corollary 6.1] 



<1 p(S)<l, 



(78) 



which proves the sufficiency of (ICll l. 



To give additional insight into the physical interpretation of 
sufficient conditions for the uniqueness of the NE, we provide 
the following corollary of Theorem [6] 

Corollary 1: A sufficient condition for ( ICll l is given by one 
of the two following set of conditions: 

— 5> (Hf 9 H-«Hf g H rg ) w r < 1, V ? G n, (C2) 



r^q 



— Y^p (Hf H -"Hf ? H r9 ) wq < 1, Vr G il, 



w r 



(C3) 



q=fr 



where w = \w\ 1 . . . , wq] t is a positive vector. □ 

Remark 6 - Physical interpretation of uniqueness conditions. 
Looking at conditions ( IC2b - dC3b . it turns out, as expected, that 
the uniqueness of a NE is ensured if the interference among 
the links is sufficiently small. The importance of conditions 
( IC21 i- (lC31 l is that they quantify how small the interference 
must be to guarantee that the equilibrium is indeed unique. 
Specifically, condition ( !C2t can be interpreted as a constraint 
on the maximum amount of interference that each receiver 
can tolerate, whereas dC31 > introduces an upper bound on 
the maximum level of interference that each transmitter is 
allowed to generate. This result agrees with the intuition that, 
as the MUI becomes negligible, the rates of the users become 
decoupled and then the rate-maximization problem in <j4j for 
each user admits a unique solution. 

Remark 7 - Special cases. Conditions in Theorem [6] and 
Corollary Q] for the uniqueness of the NE can be applied to 
arbitrary MIMO interference systems0 irrespective of the 
specific structure of channel matrices. Interestingly, most of 
the conditions known in the literature iTTll- lfl?! for the rate- 
maximization game in SISO frequency-selective interference 
channels and OFDM transmission come naturally from (1CU as 
special cases. In fact, using the Toeplitz and circulant structure 



of the channel matrices H 



rq 



= WD r ,W fl 



(cf. Section 



HV-Ab , matrix S in the uniqueness condition dCU . defined in 
(l63l . reduces to matrix S max defined in (fJTJ, showing that 
our uniqueness condition coincides with those given in |[T9l . 
Il20l and enlarges those obtained in irTII-lfTll and H21, iTTSTl . 
Observe that condition dClT i. with S max defined in ( TUT ), can 
be further weakened by computing the "max" over a subset 
of {1, . . . , N}, obtained from {1, . . . , N} by removing the 
subcarrier indexes where each user will never transmit, for 
any set of channel realizations and interference profile |[T8l . 
fl9l . An algorithm to compute such a set is given in [18]. 

Recently, in OTI . the authors studied the game <S and proved 
using ll32l Theorem 2.2] that the NE of the game is unique 



17 Recall that here we have concent rated on square nonsingular (direct) 
MIMO channels. However, condition )CU can be generalized to the case 
of rectangular channel matrices |21|. 
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if the MUI at each receiver q, measured by the interference- 
to-noise ratios {P r /a^} r ^ q where a 2 q is the variance of the 
thermal noise at receiver q, is smaller than a given unspecified 
threshold. Differently from OTl . our results provide a set of 
sufficient conditions that can be checked in practice, since they 
explicitly quantify how strong the MUI must be to guarantee 
the uniqueness of the NE. 

VI. MIMO Asynchronous Iterative Waterfilling 
Algorithm 

According to the framework developed in Section [HI] to reach 
the Nash equilibria of game ( S, one can use an instance of 
the totally asynchronous scheme of ll22l (cf. Section IHI-Bl i. 
based on the waterfilling mapping ©, called asynchronous 
Iterative WaterFilling Algorithm (IWFA) 1 2 1 1 . In the asyn- 
chronous IWFA, all the users maximize their own rate in 
a totally asynchronous way via the single user waterfilling 
solution ©. According to this asynchronous procedure, some 
users are allowed to update their strategy more frequently 
than the others, and they might perform these updates using 
outdated information on the interference caused by the others. 
We show in the following that, whatever the asynchronous 
mechanism is, such a procedure converges to a stable NE of 
the game, under the same sufficient conditions guaranteeing 
the uniqueness of the equilibrium given in Theorem [6] 

To provide a formal description of the proposed asyn- 
chronous IWFA, we need the following preliminary defini- 
tions. We assume, w.l.o.g., that the set of times at which 
one or more users update their strategies is the discrete set 
T = N + = {0, 1,2,...}. Let Q q n) denote the covariance 
matrix of the vector signal transmitted by user q at the n- 
th iteration, and let T q C T denote the set of times n at 
which Q q n ^ is updated (thus, at time n ^ T q , Qg 1 ' is left 
unchanged). Let r^(n) denote the most recent time at which 
the interference from user r is perceived by user q at the n- 
th iteration (observe that T%{ri) satisfies < r*(n) < ri). 
Hence, if user q updates his own covariance matrix at the n- 
th iteration, then he chooses his optimal Q q n \ according to 
(0, and using the interference level caused by 

q(t«(t0) a ^q( t ?W) Q (r Y l(n)) 

^ <B)) ,...,Q^ W) ). (79) 

The overall system is said to be totally asynchronous if 
the following weak assumptions are satisfied for each q [22]: 
Al) < r«(n) < n; A2) Hindoo T q r {n k ) = +oo; and A3) 
\T q \ — oo; where {rife} is a sequence of elements in T q 
that tends to infinity. Assumption (A1)-(A3) are standard in 
asynchronous convergence theory 11221 . and they are fulfilled 
in any practical implementation. In fact, (Al) simply indicates 
that, in the current iteration n, each user q can use only 
interference vectors Q_ 9 ^ n '' allocated by others in previous 
iterations (to preserve causality). Assumption (A2) states that, 
for any given iteration index m, values of the components 
of Qi 7 " ^ in d79l l generated prior to m, will not be used 

(ri) 

in the updates of Q q after a sufficiently long time ri2\ this 
guarantees that old information is eventually purged from the 



system. Finally, assumption (A3) indicates that no user fails 
to update his own strategy as time n goes on. 

Using the above notation, the asynchronous IWFA is for- 
mally described in Algorithm 1. 



Algorithm 1: MIMO Asynchronous IWFA 

Set n = and Q g ^ = any feasible covariance matrix; 

for n = : N it 

( Qq, otherwise; 

(80) 

end 



It follows directly from Theorems [2] and [5] that convergence 
of the algorithm is guaranteed under the following sufficient 
conditions. 

Theorem 7: Suppose that condition (ICU in Theorem [6] 
is satisfied. Then, as N it — > oo, the asynchronous IWFA, 
described in Algorithm 1, converges to the unique NE of 
game Sf for any set of feasible initial conditions and updating 
schedule. □ 

Remark 8 - Global convergence and robustness of the algo- 
rithm. Even though the rate maximization game Sf and the 
consequent waterfilling mapping (0 are nonlinear, condition 
dCll ) guarantees the global convergence of the asynchronous 
IWFA. Observe that Algorithm 1 contains as special cases 
a plethora of algorithms, each one obtained by a possible 
choice of the scheduling of the users in the updating procedure 
(i.e., the parameters {^(n)} and {T q }). Two special cases 
are the sequential and the simultaneous MIMO IWFA, where 
the users update their own strategies sequentially and simulta- 
neously, respectively. The important result stated in Theorem 
|7] is that all the algorithms resulting as special cases of the 
asynchronous IWFA are guaranteed to reach the unique NE 
of the game, under the same set of convergence conditions 
(provided that (A1)-(A3) are satisfied), since conditions in 
iCli do not depend on the particular choice of {T q } and 
W(n)}. 

Remark 9 - Distributed nature of the algorithm. Since the 
asynchronous IWFA is based on the waterfilling solution ©, 
it can be implemented in a distributed way, where each user, to 
maximize his own rate, only needs to measure the covariance 
matrix of the overall interference-plus-noise and waterfill over 
this matrix. More interestingly, according to the asynchronous 
scheme, the users may update their strategies using a poten- 
tially outdated version of the interference and, furthermore, 
some users are allowed to update their covariance matrix 
more often than others, without affecting the convergence of 
the algorithm. These features strongly relax the constraints 
on the synchronization of the users' updates with respect to 
those imposed, for example, by the simultaneous or sequential 
updating schemes. 

Remark 10 - Well-known cases. The MIMO asynchronous 
IWFA, described in Algorithm 1 is the natural generalization 
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of the asynchronous IWFA proposed in [20], to solve the 
rate-maximization game in Gaussian SISO frequency-selective 
parallel interference channels. Algorithm in [20] can be in fact 
obtained directly from Algorithm 1 using the following equiv- 
alences: Q, p q , WF, (■) wfg (•) , and £} q <S4> 2? q , 
where WF, (•) , wf g (-), J> g , and S? q are defined in (0, 
dl6K ©, and (fT4l . respectively. Similarly, the well-known 
sequential IWFA |7)-|[14), [19) and simultaneous IWFA EJ- 
US), lfl9l proposed in the literature are special cases of 
Algorithm 1, using the above equivalences. 

VII. Numerical Results 

In this section, we first provide some numerical results 
illustrating the benefits of MIMO transceivers in the multiuser 
context. Then, we compare some of the proposed algorithms 
in terms of convergence speed. 

Example 1 - MIMO vs. SISO. MIMO systems have shown 
great potential for providing high spectral efficiency in both 
isolated, single-user, wireless links without interference or 
multiple access and broadcast channels. Here we quantifies, 
by simulations, this potential gain for MIMO interference 
systems. In Figure Q] we plot the sum-rate of a two-user 
frequency-selective MIMO system as a function of the inter- 
pair distance among the links, for different number of trans- 
mit/receive antennas. The rate curves are averaged over 500 
independent channel realizations, whose taps are simulated 
as i.i.d. Gaussian random variables with zero mean and unit 
variance. For the sake of simplicity, the system is assumed 
to be symmetric, i.e., the transmitters have the same power 
budget and the interference links are at the same distance 
(i.e., d rq = d qr , Vg,r), so that the cross channel gains are 
comparable in average sense. The path loss 7 is assumed to 
be 7 = 2.5. 

From the figure one infer that, as for isolated single-user 
systems or multiple access/broadcast channels, also in MIMO 
interference channels, increasing the number of antennas at 
both the transmitter and the receiver side leads to a better 
performance. The interesting result, coming from Figure Q] 
is that the incremental gain due to the use of multiple trans- 
mit/receive antennas is almost independent of the interference 
level in the system, since the MIMO (incremental) gains in 
the high-interference case (small values of d rq /d qq ) almost 
coincide with the corresponding (incremental) gains obtained 
in the low-interference case (large values of d rq /d qq ), at least 
for the system simulated in Figure [T] This desired property is 
due to the fact that the MIMO channel provides more degrees 
of freedom for each user than those available in the SISO 
channel, that can be explored to find out the best partition of 
the available resources for each user, possibly cancelling the 
MUI. 

Example 2 — Sequential vs. simultaneous IWFA. In Figure [2] 
we compare the performance of the sequential and simultane- 
ous IWFA, in terms of convergence speed, for a given set of 
MIMO channel realizations. We consider a cellular network 
composed by 7 (regular) hexagonal cells, sharing the same 
spectrum. Hence, simultaneous transmissions of different cells 
can interfere with each other. The Base Stations (BS) and the 



n T = n R= 4 

n =n =2 




d /d 

rq qq 

Fig. 1. Sum-Rate of the users versus d rq /d qq ; d rq = d qr , d rr — 
d vq = 1, r = 1,2, Q = 2, 7 = 2.5, Pi/ai = P 2 /al = 5dB, 
L h = 6, N = 16. 



Mobile Terminals (MT) are equipped with 4 antennas. For 
the sake of simplicity, we assume that in each cell there is 
only one active link, corresponding to the transmission from 
the BS (placed at the center of the cell) to a MT placed in 
a corner of the cell. According to this geometry, each MT 
receives an useful signal that is comparable in average sense 
with the interference signal transmitted by the BSs of two 
adjacent cells. The overall network is thus stitched out of eight 
4x4 MIMO interference wideband channels, according to (HJ. 

In Figure [2] we show the rate evolution of the links of three 
cells corresponding to the sequential IWFA and simultaneous 
IWFA as a function of the iteration index n . To make the 
figure not excessively overcrowded, we plot only the curves 
of 3 out of 8 links. As expected, the sequential IWFA is slower 
than the simultaneous IWFA, especially if the number of active 
links Q is large, since each user is forced to wait for all the 
users scheduled in advance, before updating his own power 
allocation. The same qualitative behavior has been observed 
changing the channel realizations and the number of antennas. 
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Fig. 2. Rates of the links versus iterations: sequential IWFA (dashed 
line curves) and simultaneous IWFA (solid line curves); Q = 8, 
7 = 2.5, P q /a 2 q = 5dB, Vg £ fi, L h = 6, TV = 16. 
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VIII. Conclusions 

In this paper we have considered the competitive maximization 
of mutual information in noncooperative interfering networks 
in a fully distributed fashion, based on game theory. We 
have provided a unified view of main results obtained in 
the past seven years, showing that the proposed approaches, 
even apparently different, can be unified by our interpretation 
of the watefilling solution as a proper projection onto a 
polyhedral set. Building on this interpretation, we have shown 
how to apply standard results from fixed-point and contraction 
theory to the rate maximization game in SISO frequency- 
selective channels, in order to obtain a unified set of sufficient 
conditions guaranteeing the uniqueness of the NE and the 
convergence of totally asynchronous distributed algorithms. 
The proposed framework has also been generalized to the 
(square) MIMO case. The obtained results are the natural 
generalization of those obtained in the SISO case. 
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